N94- 32467 



A VISUAL DETECTION MODEL FOR DCT COEFFICIENT QUANTIZATION 


\ 



Albert J. Ahumada, Jr. 

Andrew B. Watson 
NASA Ames Research Center 
Moffett Field, California 94035-1000 
ahumada@vision.arc.nasa.gov 
beau@vision.arc.nasa.gov 


ABSTRACT 

The discrete cosine transform (DCT) is widely used in image compression, and is part of the JPEG and MPEG 
compression standards. The degree of compression, and the amount of distortion in the decompressed image are 
controlled by the quantization of the transform coefficients. The standards do not specify how the DCT coefficients 
should be quantized. Our approach is to set the quantization level for each coefficient so that the quantization error is 
near the threshold of visibility. Here we combine results from our previous work to form our current best detection 
model for DCT coefficient quantization noise. This model predicts sensitivity as a function of display parameters, 
enabling quantization matrices to be designed for display situations varying in luminance, veiling light, and spatial 
frequency related conditions (pixel size, viewing distance, and aspect ratio). It also allows arbitrary color space 
directions for the representation of color. In a further development, we have developed a model-based method of 
optimizing the quantization matrix for an individual image. The model described above provides visual thresholds for 
each DCT frequency. These thresholds are adjusted within each block for visual light adaptation and contrast 
masking. For a given quantization matrix, the DCT quantization errors are scaled by the adjusted thresholds to yield 
perceptual errors. These errors are pooled non-linearly over the image to yield total perceptual error. With this model 
we may estimate the quantization matrix for a particular image that yields minimum bit rate for a given total 
perceptual error, or minimum perceptual error for a given bit rate. Custom matrices for a number of images show 
clear improvement over image-independent matrices. Custom matrices are compatible with the JPEG standard, which 
requires transmission of the quantization matrix. 

1. INTRODUCTION 


1.1 DCT image compression 


The discrete cosine transform (DCT) has become an image compression standard (ref. 1, 2, 3) Typically the 
image is divided into 8x8-pixel blocks, which are each transformed into 64 DCT coefficients. The DCT transform 
coefficients I m n , of an N x N block of image pixels /. k , are given by 


N - 1 N - 1 

K,n = X X h.k C J.m C k,n «,« = 0. . . W - 1 

j = 0 k-0 

where 

C J.m = C0S(— [2; + 1]), 

and 

0C m = m = 0 

= ^J2^N m> 0 


(la) 


(lb) 


(lc) 
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The block of image pixels is reconstructed by the inverse transform: 


N-l N- 1 

b.k = X X ^ C j,m Ck,n j,k=0...N~l 

m=0 n=0 


(2) 


which for this normalization is the same as the forward transform. Quantization of the DCT coefficients achieves 
image compression, but it also generates distortion in the decompressed image. If a single coefficient is quantized 
and its block is reconstructed, the difference between the original image block and the reconstructed block is the 
error image. This error image has the form of the associated basis function, and its amplitude is proportional to the 
quantization error of the coefficient. Since the inverse transform is linear, the error image resulting from quantizing 
multiple coefficients is a sum of such images. 

1.2 The Quantization Matrix 


The JPEG compression standard (ref. 1, 2) requires that uniform quantization be used for the DCT coefficients, 
but the quantizer step size to be used for each coefficient is left to the user. The step size used for coefficient I m rt is 
denoted by Qm t n • A coefficient is quantized by the operation 

S m ,n = R ound[/ mn /&.,,] (3a) 

and restored (with the quantization error) by 

Qm y n' (3b) 


Two example quantization matrices can be found in the JPEG standard (ref. 2). These matrices appear in Table 
1 following the references. These matrices were designed for a particular viewing situation. No suggestions were 
provided for how they should be changed to accommodate different viewing conditions, or for compression in a 
different color space. Our research was initiated to provide quantization matrices suitable for compression in the RGB 
color representation (ref. 4). Subsequently, a theoretical framework was constructed and additional measurements have 
been done (ref. 5, 6, 7, 8). Here we summarize the quantization matrix design technique. It can be applied under a 
wide variety of conditions: different display luminances, veiling luminances, spatial frequencies, and color spaces. 
The basic idea of the technique is to develop a detection model that predicts the detectability of the artifacts in a 
perceptual space representation. This step is described in Section 2. A quantizer step size is then determined from the 
sensitivity of the perceptual space representation to the quantization distortion. This step is described in Section 3. 

2. THE DETECTION MODEL 


2.1 The Luminance Detection Model 


The luminance detection model predicts the threshold of detection of the luminance error image generated by 
quantization of a single DCT coefficient I m n . We use the subscript Y for luminance, since we assume that it is 
defined by the 1931 CIE standard (ref. 9). This error image is assumed to be below the threshold of visibility if its 
zero-to-peak luminance is less than a threshold T m n given by 
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(4) 


lOg Ty mH — P(f m „',by,ky,fy) 

= ^gby if L.n<=fy 

= log by + k Y (log f mn - log f Y f if f mn > f Y 

This function P represents a low-pass contrast sensitivity function of spatial frequency. Although luminance 
contrast sensitivity is more correctly modeled as band-pass, we choose a low-pass function for this application. This 
ensures that no new artifacts become visible as viewing distance increases. A low-pass function is also convenient 
because purely chromatic channels are low-pass in this spatial frequency range. We will use P for the luminance and 
chrominance channels of our model. 

The spatial frequency, f m H , associated with the m,nth basis function, is given by 

(5, 

where W x and W y are the horizontal and vertical pixel spacings in degrees of visual angle. The term by has three 
components: 


by = 


S Ty 

~e~ 


( 6 ) 


The parameter s is a fraction, 0<s<l, to account for spatial summation of quantization errors over blocks. We 
set it to unity to model detection experiments with only one block (ref. 5) . Our summation results suggest that it 
should be equal to the inverse of the fourth root of the number of blocks contributing to detection (ref. 6). We 
suggest the value s=0.25, corresponding to 16x16 blocks. The factor T Y gives the dependence of the threshold on the 
image average luminance Y . 



Y 


So 


y<y t 
y >y t 


where suggested parameter values are Y T = 15cd/m 2 , 5 0 =40, and <^=0.65. 


(7) 


The product of a cosine in the x with a cosine in the y direction can be expressed as the sum of two cosines 
of the same radial spatial frequency but differing in orientation. The factor 


=r + (l-r) 


1- 


2 fmfl /o, n 
J m. 


-|2\ 


-I ) 


( 8 ) 


accounts for the imperfect summation of two such frequency components as a function of the angle between 
them. Based on the fourth power summation rule for the two components when they are orthogonal, r is set to 0.6. 
An additional oblique effect can be included by decreasing the value of r. 

The parameters f y and ky determine the shape of P and depend on the average luminance Y . 
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( 9 ) 


fy =f 0 Y a 'Y f -<*r, Y <LY f 

= /o. Y>Y f 


k y =^y a *yf y <y* 

=*o y>y* 

where 

/ 0 =6. 8 cycles /deg 
a, =0.182 

Y f = 300cd/m 2 

A o=2 

a* =0.0706, and 
y* =300cd/m 2 . 

2.2 The Chrominance Detection Model 


00 ) 


We now add two chromatic channels to the luminance-only model. From the large number of color spaces that 
have been proposed for chromatic discriminations, we have selected one close to that suggested by Boynton (ref. 9): 
a red-green opponent channel and a blue channel. Our channels are defined in terms of the CIE 193 1 XYZ color 
space. The blue channel is just Z, and the opponent red-green channel O is given by 

0 = 0.47 X-0.37y-0.10Z. (ID 

This opponent channel is approximately the Boynton (ref. 9) (Red-cone)-2(Green-cone) channel. Our model 
now needs the threshold for quantization noise in the O and Z channels. For simplicity, we model the chromatic 
thresholds by 


log T 0 „ = Hf^i^lL,k r A), 

B_ . 4 


( 12 ) 


and 

These shapes of these are in agreement with experimental results of Mullen (ref. 10), except that the slopes 
for the chromatic channels are found to be steeper than that of the luminance channel. The reason for keeping them 
the same is to prevent strong quantization of purely chromatic channels, since there is a fair amount of individual 
variability in the exact direction of isoluminance. 
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Although we previously used Z to set the level of the Z threshold (ref. 7), we are using Y here under the 
assumption that the average image color is close to white and hence that they are roughly equal. 

Finally, we say that the errors from the quantization of a coefficient are visible, if the error in any of the three 
channels is visible. 


3. QUANTIZATION MATRIX DESIGN 


Suppose that one color dimension D in a color space linearly related to our YOZ color space is to be 
quantized. Let Dy \ Dq and Dz be the amplitudes of the errors in YOZ space generated by a unit error in D. An error 
generated in the D image by quantizing the m,nth DCT coefficient is then below threshold if it is less than 


T D . m ,„ = min 


Y,m,n 1 Q,m,n 1 Z,m,n 


Dy 


D n 


D 


z J 


(14) 


The D quantization matrix entries are obtained by dividing the thresholds above by the DCT normalization 
constants ( GC m in Equation (lc) ): 




2T, 


D,m,n 


aa m 


(15) 


The factor 2 results from the maximum quantization error being half the quantizer step size. 
3.1 Quantization in YC r Cb Space 


In an attempt to put all luminance information in a single channel, color images are often represented in the 
YCfCb color space for image compression, (ref. 2) give the transformation from RGB to YCrCb as 

r = 0.3/? + 0.6G + 0.1B, 

C r = (R-Y ) / 1.6 + 0.5 , (16) 

C b = (fi-F)/ 2 + 0.5. 

Suppose that the viewing conditions are set so that the average image luminance is 40 cd/rn^, the pixel 
spacings are 0.028 deg, and the monitor calibration of the XYZ outputs for unit RGB inputs are given by the 
matrix 



X 

Y 

Z 

R 

26.1 

13.3 

2.3 

G 

25.2 

48.9 

10.2 

B 

9.3 

4.7 

35.7 


The values of Dy y Dq> and Dz for each dimension turn out to be: 
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( 18 ) 



Dy 

Do 

D z 

Y 

66.9 

- 1.1 

48.2 

c r 

- 17.8 

17.1 

- 4.5 

c b 

- 7.0 

0.6 

67.9 


The quantization matrices appear in Table 2 following the references. 

4. SUMMARY 

We have presented a model for predicting visibility thresholds for DCT coefficient quantization error, from 
which quantization matrices for use in DCT- based compression can be designed. We regard this as preliminary 
results of work in progress. The quantization matrices computed by the techniques described above take no account of 
image content We now show how an extension of this model may be used to optimize quantization matrices for 
individual images or a class of images. 

5- LIMITATIONS OF THE IMAGE-INDEPENDENT APPROACH 

The preceding approach constructs a quantization matrix independent of the image. While a great advance over 
the ad hoc matrices that preceded it the image-independent approach has several shortcomings. The fundamental 
drawback is that visual thresholds for artifacts are dependent on the image upon which they are superimposed. 

First visual thresholds increase with background luminance., and variations in local mean luminance within 
the image will in fact produce substantial variations in DCT threshold. We call this luminance masking. Second, 
threshold for a visual pattern is typically reduced in the presence of other patterns, particularly those of similar 
spatial frequency and orientation, a phenomenon usually called contrast masking. This means that threshold error in a 
particular DCT coefficient in a particular block of the image will be a function of the value of that coefficient in the 
original image. Third, the image-independent approach ensures that any single eiror is below threshold. But in a 
typical image there are many errors, of varying magnitudes. The visibility of this error ensemble is not generally 
equal to the visibility of the largest error, but reflects a pooling of errors, over both frequencies and blocks of the 
image. We call this error pooling. Fourth, when all errors are kept below a perceptual threshold a certain bit rate will 
result The image-independent method gives no guidance on what to do when a lower bit rate is desired. The ad hoc 
"quality factors" employed in some JPEG implementations, which usually do no more than multiply the 
quantization matrix by a scalar, will allow an arbitrary bit rate, but do not guarantee (or even suggest) optimum 
quality at that bit rate. We call this the problem of selectable quality. 

6. IMAGE-DEPENDENT APPROACH 

Here we present a general method of designing a custom quantization matrix tailored to a particular image. 

This unage-dependent method incorporates solutions to each of the problems described above. The strategy is to 
develop a very simple model of perceptual error, based upon DCT coefficients, and to iteratively estimate the 
quantization matrix which yields a designated perceptual error or bit-rate. We call this the DCTune algorithm, 
because it tunes the DCT quantization matrix to the individual image(ref. 11,12). 

6.1. JPEG DCT Quantization 

In the JPEG image compression standard, the image is first divided into blocks of size {8,8}. Each block is 
transformed into its DCT, which we write I m n bJ where m f n indexes the DCT frequency (or basis function), and b 
indexes a block of the image. Each block is then quantized by dividing it, coefficient by coefficient, by a 
quantization matrix (QM) Q m n , and rounding to the nearest integer 

'S’m.n.i = ROUnd (19) 
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The quantization error in the DCT domain is then 


E m ,n,b ” I m,n,b ^ m ,n,b Qm t n 

6.2 Luminance Masking 


(20) 


Detection threshold for a luminance pattern typically depends upon the mean luminance of the local image 
region: the brighter the background, the higher the luminance threshold (ref. 13, 14). This is usually called "light 
adaptation," but here we call it "luminance masking" to emphasize the similarity to contrast masking, discussed in 
the next section. We can compute a luminance-masked threshold matrix for each block in either of two ways. The 
first is to make use of a formula such as that supplied by Peterson ex al. (ref. 7) 

T m.n.b = apw[m,M,F/ 00fc /7 00 ] (21) 

where /(x^is the DC coefficient of the DCT for block b , Y is the mean luminance of the display, and 7 00 is the 
DC coefficient corresponding to Y (1024 for an 8 bit image). 

A second, simpler solution is to approximate the dependence of T m n fc upon I^ h with a power function: 
T m ,n,b = ^m,n(^00 b/^0o) (22) 


The initial calculation of T m n should be made assuming a display luminance of Y . The parameter Oj takes 
its name from the corresponding parameter in the formula of Ahumada and Peterson, wherein they suggest a value of 
0.65. Note that luminance masking may be suppressed by setting Oj= 0. More generally, Oj controls the degree to 
which this masking occurs. Note also that the power function makes it easy to incorporate a non-unity display 
Gamma, by multiplying dj by the Gamma exponent. 

6.3 Contrast Masking 


Contrast masking refers to the reduction in the visibility of one image component by the presence of another. 
Here we consider only masking within a block and a particular DCT coefficient We employ a model of visual 
masking that has been widely used in vision models, (ref. 15, 16). Given a DCT coefficient I m n b and a 
corresponding absolute threshold T m n b our masking rule states that the masked threshold M m nb will be 


Af. 


m,n,b 


= maxi 


1 • L,. t /T, 


m,n,b I 


(23) 


where w m „ is an exponent that lies between 0 and 1 . Because the exponent may differ for each frequency, we allow a 
matrix of exponents equal in size to the DCT. Note that when w m „ =0, no masking occurs, and the threshold is 
constant at T m n b . When w m n = 1, we have what is usually called "Weber Law” behavior, and threshold is 
constant in log or percentage terms (for I mnb > T mnb ). Because the effect of the DC coefficient upon thresholds 
has already been expressed by luminance masking, we specifically exclude the DC term from the contrast masking, 
by setting the value of w 00 = 0 . 

6.4 Perceptual Error and Just-Noticeable-Differences 

In vision science, we often express the magnitude of a signal in multiples of the threshold for that signal. 
These threshold units are often called "just-noticeable differences," or jrufs. Having computed a masked threshold 
mnb , the error DCT may therefore be expressed in jnd's as 

^m,n,b ^‘m.n.b/ ^m,n,b (24) 
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6.5 Spatial Error Pooling 

To pool the errors in the jnd DCT we employ another standard feature of current vision models: the so-called 
Minkowski metric. It often arises from an attempt to combine the separate probabilities that individual errors will be 
seen, in the scheme known as "probability summation" (ref. 17) . We pool the jnds for a particular frequency m,n 
over all blocks b as 



(25) 


In psychophysical experiments that examine summation over space a of about 4 has been observed (ref. 
17). The exponent fi s is given here as a scalar, but may be made a matrix equal in size to the QM to allow differing 
pooling behavior for different DCT frequencies. This matrix *F m „ is now a simple measure of the visibility of 
artifacts within each of the frequency bands defined by the DCT basis functions. We call it the "perceptual error 
matrix." 


6*6 Frequency Error Pooling 


This perceptual error matrix *Fm t n may itself be of value in revealing the frequencies that result in the 
greatest pooled error for a particular image and quantization matrix. But to optimize the matrix we would like a 
single-valued perceptual error metric. We obtain this by combining the elements in the perceptual error matrix, using 
a Minkowski metric with a possibly different exponent , ft j 





(26) 


It is now straightforward, at least conceptually, to optimize the quantization matrix to obtain minimum bit- 
rate for a given *¥ , or minimum for a given bit rate. In practice, however, a solution may be difficult to 
compute. Bu^if fiy =°°, then *¥ is given by the maximum of the *Fm,n . Under this condition minimum bit-rate 
for a given is achieved when all . 

6.7 Optimization Method 

Under the assumption j8y=°°, the joint optimization of the quantization matrix reduces to the vastly simpler 
separate optimization of the individual elements of the matrix. Each entry of the perceptual error matrix *Fm,n may 
be considered an independent monotonically increasing function of the corresponding entry Q m n of the quantization 
matrix. 

6.8 Optimizing QM for a given bit-rate 

To obtain a quantization matrix that yields a given bit rate with minimum perceptual error V 7 we note that 
the bit rate is a decreasing function of IP, and its is a simple matter to estimate the requisite perceptual error. 

7. APPLICATION TO SPACE IMAGERY 

Image compression will play a vital role in the distribution of preview images of science data to scientists at 
distributed sites, especially in programs such as EOS and the Mission to Planet Earth(ref. 18). Due to the generally 
high performance and wide availability of the JPEG still image compression standard, we expect it to play an 
important role in this area. Since the JPEG standard includes the quantization matrix as part of the file, DCTune 
technology provides a method of optimizing the bit-rate/quality trade-off for each science image. 
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Figure 1. Voyager image of Jupiter compressed to 1.0, 0.5, and 0.25 bits/pixel, using optimal DCTune 

quantization matrices. 

Lossy image compression based on the DCT may also play a role in the recovery of scientific imagery from 
spacecraft. The Galileo orbiter spacecraft is now on its way to Jupiter. Due to a malfunction of the main antenna, 
image data will be sent to earth over an auxiliary antenna with approximately 15,000 times lower bandwidth. Image 
compression will be used to partially compensate for the loss of bandwidth(ref. 19). In support of this effort, we 
have designed quantization matrices for use in the Galileo mission, based on application of DCTune technology to 
existing Voyager and Galileo images (ref. 20, 21) . An example of DCTune algorithm applied to an image of Jupiter 
obtained by the previous Voyager mission is shown in Fig. 1 . It shows the original and three levels of optimized 
compression: 1.0, 0.5, and 0.25 bits/pixel. In this example, the parameter values used were (ij = 0.65, /? = 4, 
w m. n = 07 > display mean luminance Y = 65 cd m' 2 , image greylevels = 256, = 1024. The viewing distance 

was assumed to yield 32 pixels/degree. 
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8. SUMMARY 


We have shown how to compute a visually optimal quantization matrix for a given image. These image- 
dependent quantization matrices produce better results than image independent matrices. The DCTune algorithm can 
be easily incorporated into JPEG-compliant applications. 

In a practical sense, the DCTune method proposed here solves two problems. The first is to provide 
maximum visual quality for a given bit rate. The second problem it solves is to provide the user with a sensible and 
meaningful quality scale for JPEG (or other DCT-based) compression. Without such a scale, each image must be 
repeatedly compressed, reconstructed, and evaluated by eye to fmd the desired level of visual quality. 
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16 

11 

10 

16 

24 

40 

51 

61 


12 

12 

14 

19 

26 

58 

60 

55 

luminance 

14 

13 

16 

24 

40 

57 

69 

56 

quantization 

14 

17 

22 

29 

51 

87 

80 

62 

matrix 

18 

22 

37 

56 

68 

109 

103 

77 


24 

35 

55 

64 

81 

104 

113 

92 


49 

64 

78 

87 

103 

121 

120 

101 


72 

92 

95 

98 

112 

100 

103 

99 


17 

18 

24 

47 

99 

99 

99 

99 


18 

21 

26 

66 

99 

99 

99 

99 

chrominance 

24 

26 

56 

99 

99 

99 

99 

99 

quantization 

47 

66 

99 

99 

99 

99 

99 

99 

matrix 

99 

99 

99 

99 

99 

99 

99 

99 


99 

99 

99 

99 

99 

99 

99 

99 


99 

99 

99 

99 

99 

99 

99 

99 


99 

99 

99 

99 

99 

99 

99 

99 


Table 1 The default quantization matrices. The Qoo value is located in the upper left owner of each matrix. 
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11 

12 

15 

19 

25 

32 


11 

13 

10 

10 

12 

15 

19 

24 

Y' 

11 

10 

14 

14 

16 

18 

22 

27 

quantization 

12 

10 

14 

18 

21 

24 

28 

33 

matrix 

15 

12 

16 

21 

26 

31 

36 
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19 

15 

18 

24 

31 

38 

45 

53 


25 

19 

22 

28 

36 

45 

55 

65 


32 

24 

27 

33 

42 

53 

65 

77 



21 

21 

41 

45 

55 

71 

92 

120 


21 

37 

39 

38 

44 

55 

70 

89 

Cr 

41 

39 

51 

54 

59 

69 

83 

103 

quantization 

45 

38 

54 

69 

80 

91 

106 

126 

matrix 

55 

44 

59 

80 

100 

117 

136 

158 


71 

55 

69 

91 

117 

144 

170 

198 


92 

70 

83 

106 

136 

170 

206 

243 


120 

89 

103 

126 

158 

198 

243 

290 



45 

43 

103 

114 

141 

181 

236 

306 


43 

78 

99 

97 

113 

140 

178 

228 

c b 

103 

99 

130 

138 

150 

175 

212 

262 

quantization 

114 

97 

138 

176 

203 

232 

270 

321 

matrix 

141 

113 

150 

203 

254 

299 

347 

403 


181 

140 

175 

232 

299 

367 

434 

505 


236 

178 

212 

270 

347 

434 

525 

619 


306 

228 

262 

321 

403 

505 

619 

739 


Table 2. YCrCb quantization matrices. The values in these matrices are obtained following the proceduredescribed in 
Section 3. The Qo,0 value is located in the upper left comer of each quantization matrix. As specified in the JPEG 
standard, the values have been rounded to the nearest integer. JPEG also requires that values in the quantization 

matrix be < 255. 
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ABSTRACT 


The use of XTP and FDDI provides a high speed and high performance network solution to multimedia 
transmission that requires high bandwidth. FDDI is an ANSI and ISO standard for a MAC and Physical layer 
protocol that provides a signaling rate of 100 Mbits/sec and fault tolerance. XTP is a Transport and Network layer 
protocol designed for high performance and efficiency and is the heart of the SAFENET Lightweight Suite for 
systems that require high performance or realtime communications. Our testbed consists of several commercially 
available Intel based i486 PCs containing off-the-shelf FDDI cards, audio analog-digital converter cards, video 
interface cards, and XTP software. Unicast, multicast, and duplex audio transmission experiments have been 
performed using XTP and FDDI. We are working on unicast and multicast video transmission. Several potential 
commercial applications are described. 


INTRODUCTION 

Multimedia (voice, video, data, text, and graphics) distribution over high speed networks has many 
commercial applications which will revolutionize the way we use computers and networks. Several big 
corporations have already formed strategic alliances to explore new opportunities in this area. 

We have been researching and experimenting for several years with high speed networks which utilize 
Fiber Distributed Data Interface (FDDI), and a high performance network protocol called Xpress Transfer 
Protocol (XTP). As multimedia increased in popularity, in both military and commercial world, we started to look 
at the possibility of using XTP and FDDI in voice and video transmission. We have performed many voice 
transmission experiments using XTP with several PC i486 machines connected via FDDI network. The results 
indicate voice transmission using XTP and FDDI have many advantages over traditional methods of voice 
transmission such as fault tolerance, high bandwidth, and data integration. Right now, we are performing video 
transmission experiments using XTP and FDDI. 


XTP 

The Xpress Transfer Protocol (XTP)[1] has been developed over the past seven years from a consortium 
of private industry, academia, and government to address many high performance and realtime issues that were 
lacking in previously developed transport and network protocols. Certain concepts from existing protocols 
(e.g.VMTP, GAM-T-103, Delta-t, NETBLT) were modified and combined with new ideas to form the basis for 
XTP. Experience and other ideas have added to its development to produce the current specification [2]. 

It is a protocol that spans the Network and Transport layers (layers 3 and 4) of the OSI 7 layer model and 
therefore has some interesting features due to the coupling of an end-to-end protocol with an intermediate, 
network protocol (e.g. bandwidth reservation of an intermediate resource by an end host or packet priority 
assigned by an end host and used by an intermediate router). Because the protocol does not specify policy but 
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